Repeats and Palindromes: an Overview
نویسندگان
چکیده
With a long text string like DNA, repeats and palindromes are not easily spotted. Yet nding such substrings is important; for instance, repeats in DNA are indicators of certain hereditary disorders and are used as genetic markers. We discuss repeats and then palindromes and then we relate the two. In our discussion of repeats, we rst de ne an exact repeat and then ve de nitions of approximate repeats. We mention algorithms that search a text string for substrings that satisfy these six de nitions. In addition, we categorize the ve approximate repeats in ve di erent ways. When we look at palindromes, we look at Manacher's algorithm to nd the longest exact palindrome in a string and also an algorithm that nds the longest approximate palindrome in compressed data.
منابع مشابه
Improved Upper Bounds on all Maximal $\alpha$-gapped Repeats and Palindromes
We show that the number of all maximal α-gapped repeats and palindromes of a word of length n is at most 3(π/6 + 5/2)αn and 7(π/6 + 1/2)αn − 5n− 1, respectively.
متن کاملIn-Depth Coverage of the Icon Programming Language and Applications Constant Square-Root Palindromes
In the last issue of the Analyst, we described an application, which we have named qirplore, for exploring the space of square-root palindromes — the palindromic parts of the repeats in continuedfraction sequences for square roots [1]. In this article we’ll use qirplore to gather information about constant square-root palindromes — palindromes in which all terms are the same — and then deduce s...
متن کاملThe effect of the length of direct repeats and the presence of palindromes on deletion between directly repeated DNA sequences in bacteriophage T7.
The frequency of genetic deletion between directly repeated DNA sequences in bacteriophage T7 was measured as a function of the length of the direct repeat. The non-essential ligase gene (gene 1.3) of bacteriophage T7 was interrupted with pieces of synthetic DNA bracketed by direct repeats of various lengths. Deletion of these 76 bp long inserts was too low to be measured when the direct repeat...
متن کاملDevelopment of a Webbased Application to Detect Palindromes in Dna Sequences
Detecting palindromes in DNA sequence is a central problem in computational biology. Identifying palindromes could help scientists advance the understanding of genomic instability. DNA sequences containing long adjacent inverted repeats (palindromes) are inherently unstable and are associated with many types of chromosomal rearrangements. In this paper, we present a simple web-base tool to assi...
متن کاملRepeat Sequences and Base Correlations in Human Y Chromosome Palindromes
On the basis of information theory and statistical methods, we use mutual information, ntuple entropy and conditional entropy, combined with biological characteristics, to analyze the long range correlation and short range correlation in human Y chromosome palindromes. The magnitude distribution of the long range correlation which can be reflected by the mutual information is P5>P5a>P5b (P5a an...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014